Detecting Filled Pauses in Tutorial Dialogs

نویسندگان

  • Gaurav Garg
  • Nigel Ward
چکیده

As dialog systems become more capable, users tend to talk more spontaneously and less formally. Spontaneous speech includes features which convey information about the user’s state. In particular, filled pauses, such as um and uh, can indicate that the user is having trouble, wants more time, wants to hold the floor, or is uncertain. In this paper we present a first study of the acoustic characteristics of filled pauses in tutorial dialogs. We show that in this domain, as in other domains, filled pauses typically have flat pitch and fairly constant energy. We present a simple algorithm based on these features which detects filled pauses with 80% coverage and 67% accuracy. Analysis of the prediction failures shows that some are due to filled pauses of unusual types and related phenomena: filled pauses marking a change of state, cases where uncertainty is marked by lengthening a vowel in a word, and filled pauses which seque directly into a word. also with the Indian Institute of Technology, Kharagpur This research was sponsored in part by National Science Foundation Grant No. 0415150.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Real-time System Detecting Filled Pauses in Spontaneous Speech

This paper describes a method for detecting filled pauses (including word lengthening), which are one of the hesitation phenomena. This detection is important in speech dialogue systems because they play valuable roles in oral communication. Although there have been a few previous speech recognition systems handling filled pauses, they have not detected them individually and consequently could ...

متن کامل

Detecting laughter and filled pauses using syllable-based features

Identifying laughter and filled pauses is important to understanding spontaneous human speech. These are two common vocal expressions that are non-lexical and incredibly communicative. In this paper, we use a two-tiered system for identifying laughter and filled pauses. We first generate frame level hypotheses and subsequently rescore these based on features derived from acoustic syllable segme...

متن کامل

A real-time filled pause detection system for spontaneous speech recognition

This paper describes a method for automatically detecting filled (vocalized) pauses, which are one of the hesitation phenomena that current speech recognizers typically cannot handle. The detection of these pauses is important in spontaneous speech dialogue systems because they play valuable roles, such as helping a speaker keep a conversational turn, in oral communication. Although a few speec...

متن کامل

A Hybrid Model for Tutorial Dialogs

Until recently, rigid and sometimes cumbersome structures, which underly dialog patterns considered manageable for achieving a given task in a controlled manner, proved to be a serious weakness of interactive systems. Through the introduction of the information state as a representation to control the evolving state of a dialog, substantial improvements were obtained, with elaborations made for...

متن کامل

Semantic analysis in a robust spoken dialog system

In this paper we describe the semantic interpretation process of utterances in a spoken dialog system for train table inquiries. Spoken dialogs show a large set of problems in human–machine–communication like stops, corrections, filled pauses, non grammatical sentences, ellipses, unconnected phrases etc. In our robust approach we are able to handle a substantial amount of them. While the princi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006